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Abstract 


During  the  first  half  of  the  contract  year,  the  program  continued 
the  following  studies:  speech  analysis  by  linear  prediction,  digital 
frequency  warping,  development  of  a  high  speed  digital  processor  for 
speech  synthesis,  and  the  design  of  two-dimensional  recursive  digital 
filters.  These  projects  are  summarized  in  the  following  pages  and 
reprints  of  available  publications  are  appended. 
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1.  Speech  Analysis  by  Linear  Prediction 

The  general  direction  of  this  research  is  to  study  in  detail, 

and  to  summarize  in  quantitave  terms,  the  acoustic  properties  of  pre¬ 
stressed  consonants  of  General  American  English. 

We  have  chosen  to  study  the  acoustic  properties  of  these  consonants 
under  a  highly  controlled  environment,  where  syntactic  and  semantic 
influences  are  constrained  to  a  maximal  degree  and  where  the  phonetic 
environment  can  be  varied  in  a  systematic  way.  We  feel  very  strongly 
that,  in  order  to  study  the  acoustic  properties  of  phonetic  units  and 
find  the  relationships  between  these  acoustic  properties  and  their  under¬ 
lying  invariant  attributes,  the  construction  of  such  a  controlled  data-base 
is  essential 

We  have  restricted  ourselves  to  study  only  pre-stressed  consonants 
for  several  reasons.  First,  a  stressed  consonant- vowel  sequence  seems 
to  be  universal  among  all  languages.  Studying  pre-stressed  consonant 
might  provide  a  common  ground  where  similar  studies  of  other  languages 
could  be  attempted  and  compared.  Secondly,  stressed  syllables  in  an 
utterance  are  probably  articulated  with  greater  care  and  effort,  thereby 
resulting  in  a  robust  acoustic  sigral  where  the  extraction  of  acoustic 
parameters  is  more  reliable.  It  might  even  be  hypothesized  that  the 
acoustic  properties  of  consonants  are  least  distorted  by  the  environment 
when  they  appear  in  stressed  consonant-vcwel  (C-V)  syllables.  Acoustic 
invariants  of  consonants  are  more  likely  to  be  observed  in  such  environ¬ 
ment.  Therefore,  this  phonetic  environment  might  provide,  in  some  sense 
the  clearest  indication  of  the  ideal  relationship  between  the  underlying 

invariant  attributes  and  their  acoustic  correlates, 

.  .  s 

It  is  hoped  that  studies  of  this  type  will  lead  to  a  better  under¬ 
standing  of  the  nature  of  language.  This  study  also  has  iim. adiate  appli 
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cation  in  the  recognition  of  speech  by  machine.  Since  stressed  syllables 
seem  tc  provide  more  reliable  acoustic  information,  it  may  be  hypothesized 
that  one  should  first  exercise  considerable  effort  in  the  phonetic 
recognition  of  stressed  vowels  and  consonants.  One  would  also  expect 
algorithms  proposed  for  continuous  speech  to  perform  quite  reliably  in 
such  an  environment. 

The  acoustic  analysis  system  has  been  implemented  on  the  FDP-Univac 
computer  facility  at  Lincoln  Lab.  The  system  can  generate  linear  predic¬ 
tion  spectra  as  well  as  various  acoustic  parameters.  Spectra  and  para¬ 
meters  are  computed  every  5  msec  and  are  available  for  on-line  display 
and  examination.  They  can  also  be  stored  on  magnetic  tapes  for  later 
use. 

Several  considerations  have  been  weighted  in  the  construction  of 
the  data-base. 

1)  Our  interest  is  in  studying  stressed,  C-V  syllables  in  a 
controlled  phonetic  environment. 

2)  We  would  like  to  study  them  as  they  would  appear  in  continuous 
speech. 

3)  We  would  like  to  eliminate, as  much  as  possible,  the  linguistic 
influences. 

A)  It  was  also  decided  that  a  final  consonant  should  be  added  to 
the  syllables,  since  certain  English  vowels  do  not  appear  in  the  final 
position  of  a  word. 

5)  The  data-base  should  be  general  enough  so  that  later  studies  of 
vowels  and  post-stressed  consonants  can  be  carried  out  on  the  same 
speech  material. 

6)  Finally,  talker-dependent  phenomena,  as  well  as  variations  within 
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a  given  talker,  should  also  be  accounted  for. 

The  format  of  the  speech  material  was  finally  decided  to  be  a  non¬ 
sense  word  hS'C^W^  embedded  in  a  carrier  sentence,  "Say  —  again," 

The  size  of  the  data-base  presents  a  non-trivial  problem  in  terms  of 
data  storage,  A  rough  calculation  indicates  that  if  we  are  to  include 
all  C-V  combinations,  a  mere  10  repetitions  of  each  combination  will 
result  in  some  seventy-four  thousand  utterances  or  approximately  thirty 
hours  of  speech  material.  We  are  in  the  process  of  recording  these 
utterances,  excluding  only  those  stressed,  C-V  combinations  that  do 
not  occur  in  English.  However,  we  only  intend  to  study  a  realistic 
but  substantial  subset  of  the  utterances  in  detail. 

It  was  decided  that  we  will  use  the  Univac-FDP  system  for  signal 
processing  and  use  the  TX-2  computer  for  storage  and  on-line  data 
analysis.  An  interactive  display  and  data  retrieval  system  is  presently 
under  development  on  the  TX-2.  The  system,  when  completed,  will  be 
able  to  display  acoustic  information  in  various  forms.  Collection  of 
statistics  over  a  large  corpus  of  data  can  also  be  done  with  relative 
ease. 

Preliminary  data  indicates  that  phonetic  units  have  certain  acoustic 
cues  that  are  environment-dependent  and  others  that  are  reasonably 
invariant.  It  is  well  known  that  when  speech  sounds  are  connected  to 
form  an  utterance,  there  is  a  considerable  degree  of  overlap  in  the  en¬ 
coding  process  so  that  influence  of  one  phonetic  unit  can  be  found  in 
its  adjacent  units.  We  feel  that  there  is  probably  as  much  coarticulation 
effect  on  the  consonants  as  there  is  on  vowels,  i.e,,  acoustic  properties 
of  consonants  are  quite  dependent  on  its  vowel  environment.  With  the 
availability  of  additional  data,  we'll  be  able  to  draw  more  specific 
conclusions  on  these  observations. 
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2.  Application  of  Digital  Frequency  Warping  to  Unequal  Resolution 
and  Vernier  Spectrum  Analysis 

The  application  of  a  technique,  which  has  been  referred  to  as  digital 
frequency  warping,  to  unequal  bandwidth  spectral  analysis  has  been 
investigated.  With  this  technique,  a  sequence  is  transformed  into  a 
second  sequence  in  such  a  way  that  the  Fourier  transforms  of  the  original 
and  transformed  sequences  are  related  by  a  non-linear  transformation  of 
their  frequency  axes.  An  equal  bandwidth  analysis,  carried  out  on  the 
transformed  sequence,  then  corresponds  to  an  unequal  bandwidth  analysis 
of  the  original  sequence.  In  many  spectral  analysis  applications,  it 
is  desirable  to  have  the  analysis  bandwidth  change  with  frequency.  For 
example,  in  the  analysis  of  noise  generated  by  mechanical  systems  for 
detecting  potential  failures  it  is  often  important  to  utilize  proportional 
bandwidth  or  constant  Q  analysis  so  that  the  form  of  the  spectrum  is 
invariant  under  a  time  scaling  of  the  signal,  which  might  result,  for 
example,  from  a  change  in  the  speed  of  the  system.  In  other  instances, 
such  as  sonar  data  analysis,  it  is  desirable  to  analyze  wide  bandwidth 
data  while  obtaining  high  resolution  at  tie  low  frequencies.  In  this  case, 
it  is  generally  desired  to  have  the  analysis  bandwidth  increase  with 
frequency,  but  the  exact  form  of  the  bandwidth  as  a  function  of  frequency 
is  not  crucial.  While  the  class  of  frequency  transformations,  afforded 
by  digital  frequency  warping,  does  not  permit  proportional  bandwidth 
analysis,  it  does  permit  an  analysis  in  which  the  analysis  bandwidth  in¬ 
creases  or  decreases  with  frequency  and  also  permits  a  vernier  analysis. 

A  method  for  comparing  the  bandwidth  of  the  frequency  function  afforded 
by  digital  frequency  warping,  with  a  constant  Q  analysis,  has  been  - 
developed,  and  a  detailed  study  of  the  comparison  carried  out. 
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One  of  the  potential  advantages  of  digital  frequency  warping  lies 
in  its  relatively  simple  hardware  implementation.  This  implementation 
corresponds  basically  to  a  cascade  chain  of  first-order  digital  filters 
and  thus  an  important  consideration  in  the  use  of  this  technique  is 
its  sensitivity  to  arithemetic  round-off  noise.  As  a  part  of  the  study 
of  this  technique,  an  analysis  of  the  arithemetic  round-off  effects  has 
been  carried  cut.  It  is  shown,  in  particular,  that  these  effects  are 
not  severe. 

Tliis  work  has  been  presented  in  a  paper  entitled,  "Unequal  Resolution 
and  Vernier  Spectrum  Analysis,"  by  C.  Braccini  and  A.  Oppenheim  at  the 
Florence  seminar  on  digital  filtering.  Sept.  1972,  and  also  at  the 
International  Conference  on  Signal  Processing,  University  of  Erlagen, 
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3,  Development  of  a  High  Speed  Digital  Processor 

Work  has  continued  on  the  detailed  design  of  the  Black  Box  and 

on  component  selection.  Due  to  expansion  of  the  data  word  size  to  24 
bits,  the  chip  count  is  now  about  700.  Two  of  eight  circuit  boards  have 
been  received,  and  final  design  and  layout  of  the  multiplier,  which  uses 
these  boards,  is  proceeding.  We  are  writing  a  computer  program  to  opti¬ 
mize  'he  propagation  delay  of  the  multiplier  with  respect  to  chin  count, 
since  we  estimate  about  165  MC10181  ALU  chips  will  be  needed  for  the 
24  X  24  bit  multiplier.  Despite  its  size,  the  multiplier  should  still 
operate  in  approximately  100  nanoseconds.  Considerable  effort  has  been 
expended  on  the  I/O  interface  design.  Since  the  Black  Box  data  word 
length  is  24  bits,  its  instruction  word  length  is  50  bits,  and  we  plan 
to  interface  the  machine  to  either  18  or  16  bit  host  computers,  we  have 
provided  a  flexible  I/O  system  which  is  easy  to  use  for  both  fractional 
and  integer  data,  as  well  as  iustructi ons. 
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4.  Two-Dimensional  Filter  Design 

Recent  work  on  the  problem  of  two-dimensional  recursive  filter  design 
has  been  in  the  area  of  developing  an  algorithm  which  can  be  used  to 
select  rational  functions  in  two  variables  which  approximate  a  desired 
filter  frequency  response  in  an  optimum  sense.  The  algorithm  which  has 
been  developed  is  a  s  traightforward  extension  of  a  one-dimensional 
algorithm  which  approximates  an  ideal  function  by  rational  functions. 

This  algorithm  is  the  differential  correction  algorithm.  This  is  an 
iterative  procedure  which  is  described  in  E.  W.  Cheny's  Introduction  to 
Approximation  Theory  (p.  171).  It  can  be  proven  to  yield  the  best 
approximation  to  a  given  function  in  the  Tchebycsev  (rainimax)  sense  in 
a  region.  Implementing  the  algorithm  on  a  digital  computer,  however, 
constrains  us  to  work  on  a  finite  point  set  rather  than  over  a  connected 
region.  The  number  of  points  used  is  constrained  by  the  amount  of  avail¬ 
able  computer  memory  and  computation  time, and  to  date  the  largest  net  of 
points  has  had  544  members.  This  was  observed  empirically  to  be 
sufficient  for  some  two-dimensional  design  problems,  but  not  for  all. 

The  differential  correction  algorithm  has  also  proven  to  be  a  useful 
technique  for  designing  optimum  one-dimensional  recursive  filters,  and 
is  worthy  of  further  research  in  this  context,  particularly  in  the  area 
of  weighted  approximations  where  the  prescribed  error  tolerance  from  the 
ideal  design  is  allowed  to  vary  as  a  function  of  frequency. 

In  addition,  we  have  discovered  that  if  certain  assumptions  concerning 
the  data  to  be  filtered  are  made,  a  two-dimensional  filtering  problem  may 
be  transformed  into  a  one— dimensional  filtering  problem  by  applying  the 
one-projection  theorem  of  Mersereau  [MIT,  ScD.  thesis,  1973).  In  this  case 
standard  one— dimensional  filter  design  techniques  may  be  employed  to 
solve  the  filtering  problem. 
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To  be  presented  at  Florence  Seminar  on  Digital  Filtering 
September  21  to  22,  1972 
Florence,  Italy 

Applications  of  Dlctltal  Frequency  Warping  to  unequal  Resolution 
and  Vernier  Spectrum  Analysis 

C.  Bracclnl*  and  A.  Oppenheimt 

Recently,  a  technique  referred  to  as  digital  frequency 
warping  was  proposed^^.  The  technique  corresponds  to  processing 
a  sequence  f(n)  to  obtain  a  new  sequence  g(n)  in  such  a  way  that 
the  Fourier  transforms  of  the  sequences  are  related  hv  a  distor¬ 
tion  in  the  frequency  axis.  Specifically  with  G(e^)  and  F(e^) 
denoting  the  Fourier  transforms  of  g(n)  and  f(n)  respectively  the 
frequency  variables  are  related  by 

«  *  n  +  2  CglfssYi’  a> 

The  sequence  g(n)  is  obtained  from  f(n)  by  means  of  a  cascade 
of  two  first  order  filters  and  a  chain  of  first  order  all-pass 
networks  all  with  pole  locations  at  z  «=  a .  As  the  parameter  a 
is  varied,  then  according  to  eq.  (1)  the  amount  of  frequency  warp¬ 
ing  is  varied.  Because  this  cascade  chain  of  first-order  networks 
is  defined  by  the  single  parameter  a  and  because  of  its  modular 
nature  it  leads  to  a  relatively  straightforward  hardware  realization. 

The  frequency  warning  specified  by  eq .  (1)  is  a  monotonic 
function  of  Q  and  raps  the  interval  ~ti<0<+tt  into  the  interval 
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-n$0<+ir.  V/ith  the  coefficient  a  positive,  0(0)  has  a  slope  greater 
than  unity  at  0  =  0  and  less  than  unity  at  0=tt  .  Consequently  if 
constant  bandwidth  spectral  analysis  is  carried  out  on  the  sequence 
g(n), using,  for  example,  the  Fast  Fourier  Transform  algorithm, 
it  I'fill  correspord  to  unequal  bandwidth  analysis  of  the  sequence 
f(n)  with  the  bandwidth  monotonically  increasing  with  frequency. 

If  the  relationship  between  0  and  0  were  logarithmic  then  an 
equal  bandwidth  analysis  of  g(n)  would  result  in  a  constant  Q 
analysis  of  f(n).  In  many  applications  unequal  resolution  analysis 
is  desirablev  ~  1 .  In  some  cases  the  specific  character  of  the 
resolution  as  a  function  of  frequency  Is  not  important.  In 
other  applications  such  as  when  form  Invariance  Is  desirable, 

It  is  Important  to  have  an  approximately  constant  Q  analysis. 

One  of  the  objectives  of  this  talk  Is  to  discuss  the  use  of 
digital  frequency  warping  to  approximate  constant  Q  analysis. 

Because  of  the  fact  that  the  warping  specified  by  eq.  (1) 
has  a  slope  greater  than  unity  at  0=0  It  can  be  used  to  provide 
a  vernier  spectrum  analysis.  In  particular  the  Discrete  Fourier 
Transform  of  g(n)  provides  a  sampling  of  the  spectrum  of  f(n) 
with  close  spacing  of  the  spectral  samples  around  0=0.  With  the 
parameter  a  close  to  unity,  this  can  be  viewed  as  a  vernier 
analysis  around  0=0.  In  general,  of  course,  v/e  would  like  the 
ability  to  carry  out  a  vernier  analvsis  around  an  arbitrary  fre¬ 
quency.  This  can  be  accomplished  by  multiplying  f(n)  by  a  complex 
exponential  sequence  to  modulate  the  desired  center  frequency 
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to  A  =0 ,  implementing  the  frequency  warping  and  Fourier  transform. 
Several  examples  illustrating  the  procedure  will  be  presented. 
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1.  INTRODUCTION  Ap-ict  J  -  6  19  73 

Spectral  analysis  has  traditionally  played  «n  important  role  in  the  more  general 
area  of  signal  processing.  With  the  development  of  the  Fast  Fourier  Transform 
(fFT)  algorithm  and  the  present  trend  in  digital  hardware  it  has  become  increa¬ 
singly  possible  to  carry  out  sophisticated  spectral  analysis  digitally. 


In  its  most  straightforward  application  in  digital  spectral  analysis,  the  use  of 
the  FFT  leads  to  spectrum  analysis  corresponding  to  an  analysis  carried  out  with 
a  filter  bank  with  equal  bandwidth  filters  uniformly  spaced  over  the  entire  si¬ 
gnal  band.  In  many  spectral  analysis  applications,  however,  it  is  desirable  to 
have  the  analysis  bandwidth  change  with  frequency. 

For  example  in  the  analysis  of  noise  generated  by  mechanical  systems  for  detec¬ 
ting  potential  failures,  it  is  often  important  to  utilize  proportional  band¬ 
width,  or  constant  Q  analysis,  to  obtain  form  invariance.  In  other  instances, 
it  is  desirable  to  analyze  wide  bandwidth  data  while  obtaining  high  resolution 
at  the  low  frequencies.  In  this  case,  it  is  generally  desired  to  have  the  ana¬ 
lysis  bandwidth  increase  with  frequency  but  the  exact  form  of  the  bandwidth  as 
a  function  of  frequency  is  not  crucial. 

Another  example  of  what  can  be  regarded  as  spectrum  analysis  with  an  analysis 
bandwidth  which  is  frequency  dependent  is  a  vernier  analysis.  In  this  :ase,  we 
are  interested  in  an  analysis  over  a  small  portion  of  the  band,  with  the  effec¬ 
tive  filter  spacing  much  smaller  than  the  effective  filter  widths  so  that  the 
spectrum  is  oversampled  in  frequency.  Such  as  analysis  is  often  useful,  for 
example,  when  one  is  interested  in  detecting  and  measuring,  the  center  frequency 
of  a  narrowband  component  with  a  simple  peak-picking  algorithm  on  the  spectrum. 
This  corresponds  to  very  high  spectral  sampling  in  a  part  of  the  frequency  band, 
and  none  or  very  low  spectral  sampling  in  the  remainder  of  the  band. 


In  this  paper,  the  application  of  a  technique  referred  to  as  digital  frequency 
warping  to  unequal  bandwidth  spectral  analysis  will  be  discussed.  This  techni¬ 
que  transforms  a  sequence  in  such  a  way  that  the  Fourier  transforms  of  the  ori¬ 
ginal  and  transformed  sequences  are  related  by  a  nonlinear  transformation  of  the 
frequency  axis.  An  equal  resolution  analysis  carried  out  on  the  transformed  se¬ 
quence  then  corresponds  to  an  unequal  resolution  of  the  original  sequence. 

In  the  following  discussion,  we  will  first  present  a  frame-work  for  discussing 
the  problem  of  pectral  analysis  and  indicate  how  two  Piscrete  Fourier  Transform 
is  commonly  t»|  plied.  We  ill  then- discuss  in  a  general  sense  how  unequal  '  ni- 
vidth  spectra!  .analysis  c-n  be  implemented  by  first  implementing  a  nonline 
distortion  of  the  frcqi  ney  axis,  followed  by  n  equal  bandwidth  analysis. 
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2.  GENERAL  SPECTRAL  ANALYSIS 

Let  f(n)  denote  a  sequence  of  data  and  F(fl)  its  Fourier  transform.  A  set  of  mea 
sure irents  of  the  spectrum  as  viewed  through  a  spectral  window  will  be  denoted 
by  0*  with 


I  I 


F(fl)  H(n,nk)  dQ 


1.  2. 


N.  (1) 


-H 


The  fact  that  the  spectral  window  11(11,^)  is  a  function  of  two  variables  indica¬ 
tes  that  the  shape  and  in  particular  the  width  of  the  window  can  change  with  the 
center  frequency  Kk .  When  the  spectral  window  H(Q,Dk)  depends  only  on  the  dif¬ 
ference  (Q-fifc)  so  that  eq.  (1)  becomes 


I  I 


F(n)  H(n-nk) 


dfl 


(2) 


-ir 


an  equal  bandwidth  analysis  results.  The  spectral  measurements  Gk  can  be  thought 
of  as  corresponding  qualitatively  to  filter  bank  outputs  where  each  filters  has 
the  same  spectral  shape  with  only  the  center  frequency  \  'hanging  along  the  fil¬ 
ter  bank.  Generally,  it  is  desirable  to  choose  the  lowpass  prototype  niter  cha¬ 
racteristic  H(il)  to  approximate  unity  over  a  band  of  frequencies  with  a  width 
denoted  by  B  and  zero  outside  the  band  and  to  choose  the  s  >acing  of  the  center 
frequencies  i.e.  (  fik+i  -  %  )  to  equal  the  constant  B  .-dependent  of  k. 

In  this  way,  the  band  is  covered  by  non-overlapping  filters.  In  sone  cases, 
however,  it  is  desirable  to  choose  the  spacing  of  the  center  frequencies  to  be 
much  less  than  the  filter  widths.  In  this  case,  the  filters  are  overlapping. 

Such  an  analysis  is  commonly  referred  to  as  a  vernier  analysis  and  as  mentioned 
in  the  introduction,  can  be  useful  if,  for  example,  the  analysis  is  directed 
toward  the  detection  and  measurement  of  a  narrowband  component. 

If  the  spectral  analysis  corresponds  to  proportional  bandwidth  analysis  then 
equation  ( 1 )  taker,  the  form 

w 


F(n,  h(  )  dn 


(3) 


— 1» 

In  this  case,  the  effective  filter  width  is  proportional  to  the  center  frequen¬ 
cy.  Here  the  effective  filters  become  wider  as  the  center  frequency  increases. 

Generally  when  data  is  analyzed  using  the  discrete  Fourier  transform,  a  finite 
length  window  v(n)  is  applied  to  the  data  uni  the  DPT  of  the  proauct  cor.pu-e^. 
The  magnitude  of  the  DFT  values  then  correspond  to  spectral  samples  .k  as  3F?ci 
fied  by  cq.  (2)  with  %  =  k  U0  .  The  Fourier  transform  of  the  data  winuov  v.rj 
corresponds  to  the  spectral  window  H(S1).  Thus,  a  direct  application  of  the  DFT 
results  in  an  equal  resolution  analysis.  A  commonly  used  proce  lure  for  approxi¬ 
mating  a  constant  ^  analysis  or  more  generally  unequal  resolution  is  to  sum  ad¬ 
jacent  frequency  bins  with  the  number  of  bins  summed  increasing  -ib.a  •  requeney . 
In  the  next  section,  we  discuss  an  alternative  -hereby  we  consider  the  general 
notion  of  distorting  whe  frequency  axis  of  the  signal  to  be  processed,  followed 
by  an  equal  resolution  analysis  of  the  result. 
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3.  UNEQUAL  BANDWIDTH  SPECTRUM  ANALYSIS 
BASED  ON  FREQUENCY  TRANSFORMATION 

The  DFT  computes  spectral  samples  whose  inaptitude  is  of  the  form 


»» 

ck  »  |  J  f(q)  H(n-kc0)  an 


(«*) 


-W 


corresponding  to  an  equal  bandwidth  analysis  of  f(n).  Let  us  consider  a  sequen¬ 
ce  f(n!  which  is  related  to  f(n)  in  such  a  way  that 


F(R)  »  F(R) 


(5) 


where  0  *  0( £1) . 

A  ^ 

An  equal  resolution  analysis  of  F(fi)  according  to  equation  (U)  leads  to  the  coef¬ 
ficients 


5k  - 

j  [  F(n)  H(n-kn0)  dn  | 

(6) 

n  *  0(d), 

5*  *  1  f 

-w 

F(0)  «(«(!!)  -»,)(  de^1)  ]  da  | 

(7) 

-h 


with  nk  «  k0(no)- 

We-'  see  now,  that  this  no  longer  has  the  form  of  an  equal  resolution  anr.ysis  but, 
except  for  the  factor  d0(fi)/dn  ,  still  retains  the  general  form  of  equation  ( 1 ) . 
Thus,  we  can  interpret  equation  (7)  as  an  unequal  resolution  analysis  of  f(n)  mo¬ 
dified  by  the  frequency  characteristic  d8/dfl.  This  spectral  weighting  can  be 
compensated  for  prior  to  the  spectral  analysis  by  means  of  an  appropriate  pre- 
emphasis,  or  can  be  taken  into  account  in  interpreting  the  spectral  coefficients. 
If  the  lowpass  prototype  filter  H(fi)  has  a  bandwidth  3,  and  if  we  assume  that 
e(fi)  is  approximately  linear  over  any  interval  of  length  B,  then  the  bandwidth 
of  h(o({1)  -kP(Ho))  is  B  divided  by  the  slope  of  d(Ci)  at  9(ft)  =  K 8 ( fig )  •  Thus, 
if  0(fl)  has  slope  monotonically  decreasing  with  H,  then  the  bandwidth  of  the 
analysis  will  monotonically  increase  with  ft. 

For  the  specific  case  in  which  we  would  like  a  constant  percentage  bandwidth  ana¬ 
lysis,  0( n)  is  of  the  form 

0(n)  '  ■  a[  In  a2  I! 
so  that  eq.  (7)  takes  the  form 


-  |  f  "(»■  £  ]  T  dn  | 


(k— D 

»2  *‘0 


(9- a) 


(9-b) 


where  :  ^ 

For  a  vernier  analysis,  we  would  choose  °(tl)  to  have  a  large  slope  in  the  fre¬ 
quency  range  for  .which  the  vernier  analyses  is  desired.  _ 

There  in  not  available  a  simple  procedure  for  modifying  a  sequence  to  obtain  a 
logarithmic  frequency  transformation  cf  the  form  or  eq.  (8).  However  procedure 
has  been  developed  for  obtaining  a  frequency  transformation  of  a  specific  form. 
In  the  next  section,  we  briefly  review  this  procedure  and  then  discuss,  its  use 
for  unequal  resolution  analysis.  As  a  measure  of  the  behavior  of  the  resolution 
as  a  function  of  frequency,  we  compare  it  with  constant  percentage  bnnuvidth. 


■HMHM 


17 


4.  FREQUENCY  WARPING  USING  AN  fllL-PASS  TRANSFORMATION 
A  procedure  for  implementing,  a  frequency  transformation  of  the  form 


t 


e(n) 


( n  -  R  )  +2  arctan 
a 


asin(R--Q) 
_  _ a 

1  -  a  cos  (ft  -  U  ) 

a 


(10) 


has  been  discussed  [’ll.  The  frequency  warping  achieved  is  restricted  to  the 
functional  form  of  eq.  (10)  and  can  only  be  varied  within  the  flexibility  affor¬ 
ded  by  the  parameters  a  and  Oq  .  A  plot  of  0(0)  for  =  0  and  several  values 
of  a  is  depicted  in  figure  1. 


FIGURE  1 . 

Frequency  warping  function 
for  several  values  of  the 
parameter  a  . 


To  implement  this  frequency  warping,  we  consider  two  sequences  f(n)  and  f(n) 
with  Fourier  transforms  related  by  p ^  j  _  ej0(fi)  j 

With  0(0)  of  the  form  of  eq .  (10),  and  assuming  that  f(n)  is  zero  for  n  <  0,  it 
has  been  shown  that  f(n)  and  f(n)  can  be  related  by 

00  -jnftn 

f(k)  =  £  e  f(n)  h  ( n )  (ll) 

n=0 

where  H  (z),  the  z-transform  of  h,  (n)  is  given  by 
k  k 


"M>  - 


( 1  -  a? ) z 


?t,-l 


(  1  -  a  z  ) 


-M2 


-1  ,  ik-1 

7.  -  a 


'  1  -  a  z 


-1  J 


1  -  a  7. 


-1 


11  "On 

this  corresponds  to  processing  f(-n)eJ  a  with  the 


k  >  0 


(  1?) 


k  =  0 

system  shown  in  figure  ?. 


\ 

a 


A/ 


J, 


T)(n) 

FIGURE  2.  All  pans  network  used  to  implement  the  frequency  warping  of  fig.  1. 
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The  sequence  l(n)  is  obtained  by  sampling  the  outputs  along  the  cascade  chain  at 

n  =  0,  i .e. 

f(n)  =  gR(0)  (13) 

In  a  practical  implementation  of  this  system,  the  sequence  f(n)  would  be  of  fini¬ 
te  length  and  it  would  often  not  be  necessary  to  reverse  the  direction  of^the 
input  sequence  since  this  does  not  affect  the  magnitude  of  F(ej^)  and  F(eJfi) 
and  only  changes  the  sign  of  the  phase  of  F(eJn). 

For  =  0  and  ji  positive,  we  no:e  from  figure  1  that  the  slope  is  monotonical- 
ly  decreasing  with  frequency  and  consequently  an  analysis  using  the  warping  given 
by  eq.  (10)  with  0a  =  0  will  have  an  analysis  bandwidth  which  increases  with  fre¬ 
quency.  With  0a  4  0,  the  analysis  bandwidth  will  be  minimum  around  Q  =  Qa  .  We 
wish  now  to  compare  the  analysis  as  provided  by  the  warping  of  eq.  (10)  with 
constant  percentage  bandwidth  analysis.  To  present  the  basis  for  the  comparison 
procedure  it  is  helpfu^  to  refer  to  figure  3,  indicating  the  manner  in  which  a 
filter  with  bandwidth  B  is  reflected  by  the  ~urve  G(.l).  The  bandwidth  of  the 
filter  applied  to  the  sequence  f(n)  is 

b  =  n2  -  nt  dh) 

and  its  center  frequency  is 

nc  =  |  (  «2  +  )  (15) 


FIGURE  3. 

The  reflection  of  a  filter 
with  bandwidth  B  by  the 
curve  0( n) . 


1 

B 

T 


The  bandvidtn  and  center  frequency  of  the  equivalent  filter  applied  to  f(n)  are 

B  =  -  fli  (1 C) 


=  ~  +  n,  ) 


The  Q  of  the  effective  filters  applied  to  the*  sequence  f’(n)  is  defined- as 

Q  =  -#  •  (IB) 


For  a  constant  percentage  bandwidth  analysis  we  require  that  Q  he  independent  of 
Qc  and  consequently  that  B  bo  proportional  to  '.c .  That,  this  is  in  fact  so  for 
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the  logarithmic  transformation 


n  *  ai  In  a2  Q 

1  ft/ai 
n  *  —  ,  e 
P2 


1  f  fll  1 

follows  in  a  straightforward  manner.  Specifically  ’  =  17  expl^TJ 

a 

ft2  .  _L  exp(  Si  )  and  n2  =  fli  +  5  .  Therefore 
^  t  .xp(^-)(exp(^)^]  ^ 

Q  '  ^  =  ^  exp(^)(exp(l)-.]”  '  2  «*(£)  -  1 


1  _ 1! 

2  s 


exp(  —  )  -  1 


Since  B  and  a  are  constants,  Q  is  a  constant.  In  the  same  manner,  we  can  compu¬ 
te  the  center  frequencies  and  bandvidths  reflected  through  the  warping  curve  of 
eq.  (10).  The  exact  computation  of  the  bandwidths  leads  to  unwieldy  ^Pr^sion 
and  we  will  instead  base  the  comparison  on  an  approximate  analysis.  .The  appro 
ximate  analysis  corresponds  to  assuming  that  0(«)  is  linear  over  an  interval 
with  width  B  in  SI  .  With  this  approximation,  B  and  B  can  be  simply  related  by 
the  slope  of  0(n)  at  Q  =  nc  and  ttc  and  nc  are  simply  related  by 

a  =  e(o  )  .  ’  (20) 

c  c 

It  has  been  demonstrated  that  for  a  =  1/2  and  0  *  B  *  the  error  due  to 

this  approximation  is  less  than  1  t  and  for  B  less  than  2w/l6  the  |rroj  is  less 
than  h  S.  With  the  assumption  that  9(fl)  is  linear  over  the  range  SI,  «l  with 
slope  0*(nc),  B  and  B  are  'simply  related  by 

3  =  b  e* (nc)  •  . 

I 

Tlius,  the  Q  of  the  effective  filters  is 

Q  =  !k  a  2s.  o'(n  )  .  <*> 

B  B  C 

For  constant  percentage  bandwidth,  Q  is  constant,  while  the  Q  given  in  eq.  (22) 
[s  a  fu^cUon'of  Rc .  To  compare  this  with  constant  Q,  we  define  the  error  rela¬ 
tive  to  a  constant  value  Qr  as 

Q(n  )  -  Q  ,  , 

,o  \  _  _  c  I—  23) 

c(V  -  Qr 

where  we  have  denoted  the  dependance  or  the  error  and  Q  on  nc .  «ith  the  error 
“pressed  in  this  way.  5r  is  of  course  arbitrary  and  the  error  is  there* ^  de¬ 
pendent  on  whet  constant  Q  value  we  choose  to  c^Pdre  .  He)  with.  Conquer.  y, 
it  is  convenient  to  display  the  erro-  in  terns  of  loG(.1  +  c ( ffc )  J 


log (  1  +  E(nc)  )  =  log  Q(«c)  -  lor.  Qr 


With  s(Hc)  presented  in  this  way  a  chance  in  < 3r  ii 

placement  of  the  error  curve.  Since,  fiom  eq.  12.1,  5  c  *  we 

1/B,  it  is  also  convenient  to  consider  ^<r  to  be  propoitional  to  /-  • 

express  Qr  as  Qr  =  C/H  .  ^ 

In  figure  I.  a  set  of  error  curves  are  displayed  for  different  values  of  aandfia 
For  each  valu«*  of  a,  the  different  numbered  curves  presented  correspond  to  a 
L  In  U  w  as  also  convenient  for  the  purpose  of  displaying  these  curves 

toidi.r"  the  vertical  position  of  each  curve  so  that  they  are  tanrent  to  r  -  0. 
Thus  each  curve  displayed  has  associated  with  it  a  different  value 
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A  table  of  the  values  of  and  C  for  each  curve  is  g’ ven  in  Table  1.  We  note 
on  these  curves  that  for  each  value  of  a,  those  curves  which  are  simple  convex 
curves  appear  to  be  quite  similar.  It  has  been  verified  computationally  that 
for  values  of  modulation  between  those  given,  linea”  interpolation  provides  a 
close  approximation.  To  discuss  how  the  error  curves  should  be  interpreted,  let 
us  consider  a  specific  example.  Let  us  assume  that  we  wish  to  investigate  the 
deviation  from  constant  Q  for  a  =  1/2.  Referring  to  the  appropriate  set  of  er¬ 
ror  curves,  wo  note  that  from  curve  3  (figure  -^.b),  corresponding  to  a  modula¬ 
tion  of  zero,  it  is  possible  to  have  a  positive  error  relative  to  Qr  =  0.90  B 
between  0  and  10  %  in  the  frequency  range  0.li2  w  <  i<  <  0.93  n  .  Alternatively 
by  shifting  the  same  curve  dovnwari,  we  see  that  it  is  possible  to  have  an  error 
between  i  10  %  relative  to  Qr  =  B  in  the  frequency  range  0.26  tt  <  0  <  1.0U  it  . 
For  a  =  3 A ;  with  Qa  =  0,  it  is  possible  to  have  an  error  between  i  10  %  in  the 
frequeny  range  0A3  n  <  0  <  1.05  17  with  Qr^=  0A2  B  .  We  note  that  for  Qr  to 
be  same  for  the  two  different  values  of  a,  B  can  be  larger  if  a  =  1/2  than  if 
a  =  3A-  Thus,  with  a  =  3A;  a  wider  bandwidth  analysis  can  be  used  On  the 
sequence  f(n). 

In  a  practical  setting,  we  may  wish  to  use  the  frequency  warping  to  implement  an 
approximation  to  a  constant  Q  analysis.  Given  the  frequency  range  to  be  analy¬ 
zed  and  the  allowable  deviation  from  constant  Q,  we  can  then  use  the  curves  of 
figure  l*  to  choose  the  parameters  a  and  Qa.  The  design  procedure  will  generally 
be  of  a  trial  and  error  kind.  The  choice  of  9a  will  generally  depend  on  the 
range  to  be  analyzed.  The  parameter  a  influences  tha  value  of  Qr  so  that  as  & 
increases  the  value  of  B  required  to  approximate  a  given  Q  will  decrease  requi¬ 
ring  a  finer  resolution  analysis  of  f(n).  At  the  same  time?' it  is  generally 
possible  to  extend  the  frequency  range  to  lower  frequencies  as  a  increases. 
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5.  NOISE  ANALYSIS  OF  DIGITAL  FREQUENCY  WARPING 

The  implementation  of  digital  frequency  warping  utilizes  a  cascade  chain  of  all¬ 
pass  networks  as  depicted  in  figure  2. 

In  the  previous  sections,  we  have  considered  the  effect  of  the  parameter  a  with 
regard  to  the  applications  to  unequal  bandwidth  analysis.  From  figure  2,  we  see 
that  as  the  parameter  a  increases  toward  unity,  the  poles  approach  the  unit  cir¬ 
cle.  Consequently  we  would  expect  that  with  finite  register  length  arithmetic 
considerations  of  roundoff  noise  will  play  an  important  role.  Thus,  this  section 
is  directed  toward  an  analysis  of  roundoff  noise  in  implementing  digital  fre¬ 
quency  warping. 

An  implementation  of  the  system  of  figure  2  in  terms  of  multipliers,  delays  and 
adders  is  shown  in  figure  5-  Ilote  that,  after  the  first  delay,  the  outputs  of 
this  system  differ  from  those  of  figure  2  by  a  factor  of  (l  -  a2).  Since  the  sys¬ 
tem  of  figure  5  implies  a  simple  hardware  structure,  it  is  the  system  of  fig.  5 
that  will  be  analyzed  with  regard  to  roundoff  noise.  The  factor  (1  -  a2)  is  easi¬ 
ly  accounted  for  in  a  number  of  ways  such  as  dividing  f(0)  by  (l-a2)  and  ap¬ 
plying  a  scale  factor  of  (1  -  a2)  in  interpreting  the  amplitude  of  the  sequen¬ 
ce  f(n) . 

FIGURE  5 . 

Representation  of  warping  network 

in  terms  of  multipliers,  delays  and  adders. 
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In  analyzing  the  roundoff  noise,  we  will  consider  only  fixed  point  arithmetic 
and  assume  that  rounding  is  applied  after  multiplication.  The  register  length 
will  be  considered  to  be  b-bits  plus  sign  so  that  the  error  due  to  rounding  is 
between  plus  anu  minus  .  We  will  assume  that  the  rounding  error  can  be 

represented  staticcally.  Thus,  white  noise  generators  can  be  inserted  after  the 
multiplies  to  account  for  roundoff  noise  as  shown  in  figure  6. 

FIGURE  6. 

Network  of  figure  5  with  roundoff  noise  sources  inserted. 
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Each  of  the.  noise  generators  ek(n)  are  assumed  to  be  uncorrelated,  white,  uni¬ 
formly  distributed  in  amplitude  between  ‘  V2*2"°  with  variance  : 

a1  «  J_.  2~2b  ^  total  noise  output  at  the  kth  tap  is  denoted  by  n^U)* 

e  12 

Let  Hk  r(z)  denote  the  transfer  function  from  the  kth  noise  source  input  to  the 
rth  output  tap  where  r  2  k.  Then,  since  we  assume  that  the  e*(n)  are  white  and 
uncorrelated 

i  * 

o2(r)  *  var(nr(n))  *  a2  ^  ^  f  lHk  ,rte^  ^  ^ 

n  -a 

It  can  be  verified  by  inspection  of  figure  6  that 
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It  is  interesting  to  note  that  the  increase  of  o£(r)  with  r  is  relatively  mild. 
If  ie  assume  that  a  is  r  it  close  to  unity  then  for  r  large  o'(r)  is  approximate¬ 
ly  pronortional  to  r.  In  addition  to  computing  the  output  noise  variance, 
however,  v*  rust  conrider  the  effect  of  limited  dynamic  range.  With  lifted 
dynamic  range  taken  into  account,  we  can  compute  a  noise  to  signal  ratio  a.  each 
output  node  in  the  network  of  figure  6. 

To  develop  an  approximate  analysis,  let  us  consider  an  input  u/i^ai-o 

gaussian  white  noise  with  variance  rf.  Then  the  signal  at  each  o  ~ 
gaussian .  With  o^r)  denoting  the  variance  of  the  output  sirnal 

node,  it  follows  that  " 

.  -»  1  f  i„  /-.)wil2  j..  (26) 
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v  ./  ?  o?(»)  >  'U'j)  •  We  note  furthurmore  that  ^(r)  is  independent 

of  r  -or  r  >  V  Thun  let  u.-  areu-r*  that  the  mxir.ur.  output  sirnal  variance  oc- 
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overflow  a  certain  percental  of  the  t’me  and  as  we  scale  the  input  ji^sal 
the  output  sipral  variance  ia  scaled  anu  >c»isequently  also  so  is  the  overflow 
rate.  It  is  generally  convenient  to  consider  the  overflow  rate  to  be, indepen¬ 
dent  of  a  in  which  case  we  scale  of  such  that"  Og(r)  is  constant,  i.e.'oj!(r)  »  C. 
In  that  case,  the  maximum  output  noise  to  sigpal  ratio  is  given  by 

„2  .  _ 
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According  to  eq.  (31),  the  noise  to  signal  ratio  increases  monotonically  with  r 
and  thus  the  largest  noise-to-sigpal  ratio  occurs  in  the  last  stage.  With  R  ♦  1 
denoting  the  total  number  of  stages,  and  assuming  that  R  is  relatively  large, 
the  noise-to-sifpal  ratio  in  the  last  stage  can  be  approximated  as 


If,  for  excunple,  we  consider  a  *  3/1*  and  R  *  29 
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In  general,  we  would  choose  the  constant  C  on  the  basis  of  the  percentage  of  over¬ 
flow  which  we  are  willing  to  permit.  For  this  example,  let  us  choose  C  so  that 
the  standard  deviation  of  the  output  is  1/U,  i.e.  C  =  l/l6.  Then 


or,  in  db,  10  logio 


f!n1  _ 
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Thus  for  example  with  b  *  17,  the  noise-to-signal  ratio  is  approximately  minus 
70  db.  If  we  choose  C  instead  so  that  the  standard  deviation  is  1/2,  then  with 
b  *  17,  the  noise-to-signal  ratio  is  minus  76  db. 

The  above  analysis  is  based  on  the  attitude  that  we  are  willing  to  permit  the 
filter  to  overflow  a  certain  percentage  of  the  time.  A  more  pessimistic  analy¬ 
sis  can  also  be  carried  out  for  which  we  scale  the  input  to  prevent  overflow 
in  the  worst  case.  To  develop  this  analysis,  let  ho,r(n)  denote  the  impulse 
response  of  the  system  from  the  input  to  the  kth  output  node  and  Sfc(n)  deno¬ 
te  the  output  at  the  kth  node. 

In  general,  tne  input  will  be  of  finite  length  N,  and  thus  we  assume  that 
x(n)  *  0  for  n  <  0  and  n  i  N  .  Then  n 


sk(n>  - 


h0,k(r)  ,<n‘r) 


and  thus 


| S.  ( n ) |  <  max(x(n))  £  |h  (r)| 

r=0  • 


We  are  interested  in  the  system  outputs  for  n  «  II  and  thus  we  iterate  the  sys¬ 
tem  N  times.  Then,  for  n  <  N,  we  write  that 

N 

| S^.( n '  |  <  max (x( n ))  V  s|hQ  k(r)|  n  «  N  .  ( 31* ] 

overn  r=0  ’ 


n  i  N 


To  prevent  overflow,  we  require  that  |^k(n)|  <  1  for  all  k,  which,  from  eq.(3*0, 
is  guaranteed  by  sealing  the  input  amplitude  so  that  ' 
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where 


max(x(n))  <  — 
over  n 

N 

nax  ^  lhOk(r)l 

over  k  r=0  * 


If  we  assume  that  the  input  is  white  noise  uniformly  distributed  in  amplitude 
between  plus  and  minus  1/M,  then  a2  =  1/3M2  and  the  variance  of  the  output  si¬ 
gnal  Sk(n)  is 

°s{0)  =  W  (36..) 


02/k)  =  _L_  a2(_1  1  a2) 

Vk'  3M2  n 


k  S  1 
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Combining  eqs.  (36)  and  (27),  the  output  noise-to-signal  ratio  is 
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From  eqs.  (37. a)  and  (37 .b),  it  is  clear  that  the  noise-to-signal  ratio  is  a  mo- 
notonic  function  of  k  and  consequently  the  largest  noise-to-signal  ratio  occurs 
in  *he  last  stage.  Then,  with  K+  1  denoting  the  number  of  stages  in  the  sys¬ 
tem,  and  assuming  K  1 ,  the  noise-to-signal  ratio  in  the  last  stage  can  be 
approximated  as  2iv\ 

V5  .  ,  .,2  ,  <■!■--  id!2.  „2 


=  3  Mz  K 


■(l+a?)  c 


To  observe  how  the  noise-to-signal  ratio  is  influenced  by  the  parameters  K,  a 
and  M,  we  must  consider  the  influence  of  these  parameters  on  M  as  defined  by 
eq.  (35).  To  this  end,  we  consider  S(k)  defined  as 


so  that 


SOU  =  T  |hQ  (r) 

r=0 

M  =  max  (s(k)) 
ove  r  k 


Figure  7  shows  a  plot  of  S(k)  for  several  different  v  llues  of  a  as  a  function 
of  II  and  k.  There  are  a  number  of  trends  evident  from  these  plots.  First,  we 
note  that  for  a  given  II  and  _a,  S(k)  reaches  a  maximum  and  then  decreases. 
Furthermore,  this  maximum  is  reached  for  x  small  compared  with  II,  and  the  value 
of  k  for  which  S(k)  readies  a  maximum  decreases  as  a  increases.  Consequently, 
it  is  reasonable  to  consider  M  to  be  independent  of  K  in  eq.  (33).  Thus,  from 
eq.  (38),  the  noise-to-signal  ratio  does  not  increase  rapidly  with  K,  i.e.  it 
is  relatively  insensitive  to  the  number  of  stages. 


[1]  A. I/.  OPP FVI/EIM  and  V.ll.  JOHNSON  "Discrete  Representation  of  Signals" 
Proceedings  of  the  T.E.K.K.,  60  n°  6,  pp.  681-691  (june  1972). 
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1.  INTRODUCTION 


Over  the  past  five  to  ten  years,  digital  signal  processing  has  been  a  growing 
field  in  terms  of  the  development  of  new  techniques  and  hardware  and  their  appli¬ 
cation  to  a  broad  class  of  problems  [lj.  Speech  processing  has  traditionally 
been  an  important  area  of  application  for  digital  signal  processing  and  the 
growing  importance  of  digital  signal  processing  techniques  to  speech  processing 
is  evident  £21.  Similarly  these  techniques  are  increasingly  important  in  seismic 
data  analysis,  mechanical  vibration  analysis,  biomedical  signal  processing,  radar 
and  sonar  systems  and  picture  processing.  This  growth  has  resulted  partly  from 
a  succession  of  theoretical  developments  such  as  the  Fast  Fourier  Transform  ( FFT) 
algorithm  [3],  cepstral  analysis  and  homomorphic  filtering  [4,5],  digital  filter 
design,  signal  analysis  by  linear  prediction  [61  and  a  long  list  of  others.  In 
addition  the  developments  in  integrated  circuit  technology  have  led  to  the  imple¬ 
mentation  of  increasingly  sophisticated  special  purpose  digital,  signal  processing 
hardware  [7]  and  general  purpose  digital  signal  processing  computers  [8]. 


2.  DIGITAL  SIGNAL  PROCESSING  ACTIVITIES  AT  M.I.T. 


Digital  signal  processing  is  an  active  research  area  at  M.I.T.  There  is  a  dipi- 
tal  signal  processing  research  proup  in  the  Research  Laboratory  of  Electronics 
and  another  at  Lincoln  Laboratory.  In  addition  there  are  a  number  of  research 
groups  which  are  not  directly  involved  in  research  on  dipital  signal  processing 
but  rely  heavily  on  dipital  signal  processing  techniques  to  pursue  their  main 
.  research  interests.  There  is  generally  considerable  interaction  among  all  of 
.  these  groups.  In  this  talk  1  will  not  try  to  be  comprehensive  in  describing  the 
research  projects  in  progress,  and  for  the  most  part  will  confine  my  comments  to 
a  few  research  areas  presently  being  investigated  by  ny  group  in  the  Research 
Laboratory  of  Electronics.  Hope  fully  these  cv.ments  will  suggest  the  flavor  of 
some  of  our  digital  signal  processing  activities  at  M.I.T., 
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The  general  scope  of  our  activities  includes  the  investigation  of  a  number  of 
theoretical  issues  and  new  digital  signal  processing  techniques  and  the  applica¬ 
tion  of  these  techniques  to  speech  and  picture  processing.  As  an  example,  over 
the  past  several  years  ve  have  been  interested  in  models  for  representing  arith¬ 
metic  errors  due  to  finite  word  length  arithmetic  [91.  More  recently  ve  have 
been  interested  in  the  analysis  and  development  of  digital  filter  structures  with 
law  sensitivity  to  finite  register  length  effects  [10l.  Our  approach  is  to  try 
to  develop  and  utilize  a  network  theory  to  obtain  nev  filter  structures.  Follo¬ 
wing  the  approach  taken  by  Fettveis  [ill  ve  have  developed  a  number  of  netvork 
properties  of  digital  filtei  including  some  large-scale  sensitivity  formulas  [12]. 
In  addition  ve  are  investigating  a  number  of  computational  methods  for  sensitivi¬ 
ty  analysis  of  digital  filters  [13]. 

Another  project  has  been  directed  toward  the  discrete  representation  of  continu¬ 
ous-time  signals  [ml.  This  work  has  led  among  other  things  to  a  notion  referred 
to  as  digital  frequency  warping  which  has  potential  application  to  unequal  reso¬ 
lution  and  vernier  spectrum  analysis  as  discussed  in  detail  in  another  paper  at 
this  conference  [  1 5 1 .  We  are  presently  exploring  also  the  application  of  this 
technique*  to  some  speech  processing  problems. 

In  speech  processing  our  interest  is  primarily  in  the  application  of  digital  si¬ 
gnal  processing  techniques  for  bandwidth  compression  and  coding  and  for  parameter 
e:  (tract  ion  for  speech  recognition,  speaker  verification  etc.  Much  of  our  approach 
to  speech  processing  has  been  based  on  homomorphic  analysis  of  speech  [5].  Re¬ 
cently  ve  have  been  investigating  and  utilizing  the  techniques  of  linear  predic¬ 
tion  for  speech  analysis  [6, 16, 17 J. 

A  different  class  of  problems  which  are  becoming  increasingly  important  and  inte¬ 
resting  are  those  involving  multi-dimensional  digital  signal  processing.  In  ima¬ 
ge  processing  Tor  example,  digital  filtering  is  important  in  such  problems  as  re¬ 
ducing  noise,  inverse  filtering,  matched  filtering  for  pattern  recognition,  con¬ 
trast  enhancement  and  dynamic  range  compression,  etc.  C 1 8 1 .  In  contrast  with  one- 
dimensional  filters  however,  the  problem  of  desirn  of  multi-dimensional  digital 
filters  is  for  the  most  part  unresolved.  The  difficulties  in  multi-dimensional 
filter  design  arise  from  the  fact  that  the  system  function  is  a  ratio  of  multi¬ 
dimensional  polynomials.  Factorization  theorems  for  multi-dimensional  polyno¬ 
mials  do  not  exist  as  they  do  for  one  dimensional  oolynonials  and  consequently 
multi-dimensional  system  functions  cannot  be  simply  characterized  in  terms  of 
poles  and  zeros.  Thus  stability  is  considerably  harder  to  guarantee.  Furthermo¬ 
re  in  one  dimension,  the  approximation  problem  is  generally  based  on  the  design 
of  a  squared  magnitude  function  which  is  then  factored.  In  multi-dimensions, 
it  is  not  generally  known  how  to  specify  a  squared  magnitude  function.  Further¬ 
more,  the  genera]  theory  of  approximation  by  multi-dinensional  polynomials  is  con¬ 
siderably  less  developed.  Thus  the  general  area  of  multi-dimensional  digital  fil¬ 
ter  design  presents  a  number  of  interesting  research  problems  which  we  are  pre¬ 
sently  pursuing. 

Another  project  involving  multi-dimensional  sirnal  processing  is  concerned  with 
the  digital  reconstruction  of  multi-dimensional  signals  from  their  projections 
[19,20 J.  In  many  applications  a  set  of  projections  of  an  U-dir.ensional  object 
into  ( M—  1 )  dimensions  ar°  available  from  which  it  is  useful  to  reconstruct  the 
original  object.  X-ray  photographs  for  example  represent  two  dimensional  projec¬ 
tions  of  the  three  dimensional  object  which  has  been  X-rayed. 

v 

The  basis  Tor  most  of  the  algorithms  which  perform  reconstructions  is  a  theorem 
which  is  referred  to  as  the  projection-slice  theorem.  The  theorem  states  that 
the  projection  at  any  orientation  o**  an  !I-dinensionaI  pbjcct  has  a  Icurier  trans¬ 
form  which  is  an  (ll-t;  dimensional  slice  at  the  r-atre  orientation  of  the  N-dimen- 
siori'i]  Fourier  transform  of  the  object.  On  the  basis  of  tnis.  each  projection 
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provides  a  dice  of  the  Fourier  transform  and  if  enough  slices  are  obtained  th- 
Fourier  transform  and  hence  the  original  object  can  be  reconstructed. 

Based  on  formulating  the  problem  in  digital  signal  processing  terms,  a  number  of 
new  algorithms  have  been  developed  which  appear,  based  on  several  reconstruction 
examples ,  to  he  superior  to  previous  algorithms.  Furthermore  a  number  of  new 
theoretical  resuLts  have  been  developed  relating  to  the  minimum  number  of  projec¬ 
tions  necessary  for  exact  reconstruction.  It  has  been  shown  in  particular  that 
taken  at  the  right  orientation  exact  reconstruction  can  theoretically  be  carried 
out  from  only  a  sinrle  projection.  While  reconstruction  based  on  this  theorem  is 
too  sensitive  to  apply  in  practice  the  theorem  appears  to  have  potential  applica¬ 
tion  to  multi-dimensional  filter  design.  Seme  of  the  more  practical  reconstruc¬ 
tion  algorithms  have  been  applied  to  the  reconstruction  of  a  section  of  a  leg 
bone  from  real  X-ray  data. 

The  above  comments  complete  the  brief  survey  of  some  of  our  digital  signal  pro¬ 
cessing  activities.  I  would  like  to  conclude  with  a  short  description  of  the 
computer  facilities  which  we  use. 


3.  FACILITIES 


There  are  two  primary  facilities  that  we  have  available  to  us  :  a  Digital 
Equipment  Corporation  PDF. 9  computer  at  the  Research  Laboratory  of  Electronics 
and  a  high  speed  signal  processing  computer,  the  Fast  Digital  Processor  ( FDP)  at 
Lincoln  Laboratory  [8].  The  PDP.9  computer  is  a  standard  medium  size  computer 
with  18  bit  fixed  point  arithmetic.  In  our  facility  we  have  2hK  of  core  ir  mory, 
two  250K  word  discs,  four  dec  taper  a  graphics  display,  X-Y  tablet,  disT  .ay  pro¬ 
cessor  and  point  display,  and  high  speed  printer.  In  addition  there  is  periphe¬ 
ral  equipment  interfaced  for  audio  input  and  output.  In  general  the  facility  is 
used  in  an  interactive  way. 

The  FDP  is  a  special  purpose  programmable  signal  processor  designed  and  built  at 
Lincoln  Laboratory  and  is  interfaced  to  a  Univac  1219  computer.  It  utilizes  a 
ligh  speed  memory  and  operates  with  a  memory  cycle  time  of  250  nanoseconds.  It 
has  a  hignly  parallel  configuration  with  four  arithmetic  elements  which  communi¬ 
cate  with  each  other  and  operate  in  parallel.  In  general  it  can  implement  signal 
processing,  algorithms  on  the  order  of  50-100  times  faster  than  the  PDP.9.  For 
example,  the  implementation  of  an  FFT  with  256  complex  or  ^12  real  points  requi¬ 
res  on  the  order  of  2  nsec.  This  corresponds  to  real  time  spectral  analysis  at  a 
100  -  200  KHZ  input  sampling  rate.  A  two-dimensional  Fourier  transform  of  a  256 
by  256  point  picture  requires  approximately  1  second.  In  general  the  speed  of 
the  FDF  permits  real  time  digital  signal  processing  for  a  wide  variety  of  pro¬ 
blems.  For  many  others  such  as  picture  processing  it  offers  the  possibility  of 
highly  interactive  processing. 
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